Rebrickable exploration

Mateusz Tabaszewski, Bartłomiej Pukacki

2023-04-14


Themes


In this section we will explore data regarding themes.

Distribution of Themed Set Releases

Cumulative Sum of Sets for Chosen Themes

The plot shows the cumulative sum for the number of sets for selected themes over the years. The chosen themes were based on their popularity, which was determined by the total number of produced sets. For this visualization, the top 5 themes with the highest number of produced sets were chosen.

As seen on the plot, the “Books” theme is the oldest among the presented themes and experienced slow growth until around the year 2010 when there was a sudden increase. Many of the presented themes experienced quick growth in the 2000s. “Technic” had the most sets created for it until relatively recently when the “Star Wars” theme’s explosive growth surpassed it. This illustrates the importance of the “Star Wars” brand for LEGO and how collaborating with a well-known brand can influence the company’s future.

Number of Produced Star Wars Sets

The previous visualization has shown the significance of the “Star Wars” brand and theme for the LEGO company. It would now be beneficial to take a closer and more thorough look at the growth of the number of sets for the “Star Wars” theme.

The plot displays the number of sets produced in a given year for the “Star Wars” theme. Additional dotted, horizontal lines were included to show the release dates for both the main prequel trilogy and the sequel trilogy. The original trilogy’s release dates were not shown since they predate the release of any “Star Wars” themed LEGO sets.

Regarding the general trend, it can be seen that the number of produced sets increased considerably over the years. However, recently, the number of produced “Star Wars” themed sets has been declining steeply. Interestingly, 3 out of the 6 shown movie releases (Attack of the Clones, Revenge of the Sith, The Force Awakens) coincided with local maxima for the number of sets produced. Additionally, the release of “Phantom Menace” coincided with the start of collaboration between LEGO and Star Wars in producing sets, with a significant spike visible within a year of the release. However, the release of “Rise of Skywalker” did not cause a significant increase in the number of produced themed sets, neither before nor after the premiere.


Themes ~ sets


Table of parent themes


Parts


Data

Number of Unique LEGO Blocks per Year

The plot shows the number of unique LEGO blocks produced in a given year. Unique parts in this case are defined as all LEGO parts that appear only in one set.

The number of unique LEGO blocks appears to have increased throughout the years, which makes sense, as with time more sets are produced in a year by the company. As such, more unique blocks may also be produced. However, the growth is not smooth, and there appear to be significant spikes throughout the years in the number of unique blocks produced.

Cumulative Sum of Rare LEGO Parts through Years

The plot shows the cumulative sum of unique LEGO blocks produced in a given year. Unique parts in this case are defined as all LEGO parts that appear only in one set.

The graph shows how the total number of unique blocks increased through the years. Interestingly enough, the growth appears to be about exponential in nature although not completely smooth, with significant increases in a given year being noticeable on the plot’s line.

Number of Rare LEGO Parts vs Average Number of Parts for a Theme

This interactive scatterplot shows the number of rare LEGO parts on the y-axis and the average number of parts in a set for a given theme. Meaning that each point is a theme with a given number of rare LEGO parts and given average number of parts for a set. The rare LEGO parts on this plot refer to LEGO parts/blocks that appear in only one of the sets across all themes. Using this plot, collectors can see which themes are most worth buying when it comes to getting the highest number of unique parts among non-unique LEGO parts.

Average Number of Parts per Set

The plot shows the average number of parts in a set for each given year.

The average number of parts in a set has increased significantly throughout the years, however, there have been significant spikes and fluctuations, as can be seen in the graph. For example, the average number of parts in a LEGO set was much higher in the year 1960 than in the years before or after.


Parts ~ themes


Themes with the Highest Amount of Unique LEGO Parts

Graph Explanation: The horizontal barchart shows the names of 20 themes with the highest number of unique LEGO parts ordered according to the decreasing number of unique LEGO parts. Unique parts in this case are defined as all LEGO parts that appear only in one set.

Graph Description/Intermediate Conclusions: Themes with highest number of unique parts include, among many others, themes like: “Duplo and Explore”, “Ninjago” and “Star Wars”. It appears that themes that could be considered to be visually “unique” are also more likely be chosen as a set with highest number of unique parts. Some themes may also end up gaining advantage in this category thanks to them having more sets overall, like the “Star Wars” theme.

Average and Maximum Number of Parts

The horizontal bar chart shows the average and maximum number of parts in a set for a given theme. The themes on the plot are ordered according to decreasing value of average number of parts for a set of that theme. The chosen themes in this case refer to themes with the highest average number of parts per set.

The horizontal bar chart shows the average number of parts for the 20 themes with the highest average number of parts in a set. The themes on the plot are ordered according to decreasing value of the average number of parts for a set of that theme.


Sets


Number of sets over years

The plot shows the total number of sets produced across all themes for each year. An exponential regression line has been added to show the observed trend of the growth of the number of produced sets in a year. The plot shows only the data up to 2022, as the information about the number of produced sets for 2023 is incomplete and could influence the shape of the regression line.

The increase appears to be exponential in nature, with the drop at the end caused by the fact that the data for 2023 only includes sets produced so far into the year. It’s likely that many more sets will still be produced until the end of the year. The exponential regression line predicts further growth in the number of produced sets for the year 2023.


Colors


In the next section we will explore the colors data.

Proportion of transparent colors